A Comparison of Some Morphological Filters for Improving OCR Performance

نویسندگان

  • Laurent Mennillo
  • Jean Cousty
  • Laurent Najman
چکیده

Studying discrete space representations has recently lead to the development of novel morphological operators. To date, there has been no study evaluating the performances of those novel operators with respect to a specific application. This article compares the capability of several morphological operators, both old and new, to improve OCR performance when used as preprocessing filters. We design an experiment using the Tesseract OCR engine on binary images degraded with a realistic document-dedicated noise model. We assess the performances of some morphological filters acting in complex, graph and vertex spaces, including the area filters. This experiment reveals the good overall performance of complex and graph filters. MSE measures have also been performed to evaluate the denoising capability of these filters, which again confirms the performances of both complex and graph filtering on this aspect.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Morphological filters for OCR: a performance comparison

In this article is compared the ability of several morphological operators to improve OCR performance when used as preprocessing filters. An experiment on binary and greyscale images using the Tesseract OCR engine and morphological filters acting in complex, graph and vertex spaces has thus been performed and results in a good overall performance of complex and area filters. MSE measures have a...

متن کامل

Improving the Performance of ICA Algorithm for fMRI Simulated Data Analysis Using Temporal and Spatial Filters in the Preprocessing Phase

Introduction: The accuracy of analyzing Functional MRI (fMRI) data is usually decreases in the presence of noise and artifact sources. A common solution in for analyzing fMRI data having high noise is to use suitable preprocessing methods with the aim of data denoising. Some effects of preprocessing methods on the parametric methods such as general linear model (GLM) have previously been evalua...

متن کامل

Document image restoration using binary morphological filters

This paper discusses a method for binary morphological lter design to restore document images degraded by subtractive or additive noise, given a constraint on the size of lters. With a lter size restriction (for example 3 3), each pixel in output image depends only on its (3 3) neighborhood of input image. Therefore, we can construct a look-up table between input and output. Each output image p...

متن کامل

Improving OCR Performance in Biomedical Literature Retrieval through Preprocessing and Postprocessing

Today’s information retrieval (IR) techniques are mostly text-based. As a consequence, some types of information are beyond the reach of text-based IR systems, which fail in situations where textual information can not be easily accessed, e.g. textual information in biomedical images and figures. To tackle such situations, we propose to augment IR systems with the ability to perform optical cha...

متن کامل

Comparison of the performance of plant nano-hydrogels as bio-filters for nitrite uptake from effluent of fish farms

The aim of this study was to evaluate the performance of plant nano-gels as a new adsorbent and to remove nitrite at low cost. At first, plant nano-gels (nano bagasse, chitosan-functionalized nano-fiber and lignocellulose nano-fiber) were prepared by mechanical and top-down mechanism. Batch system medium was designed to measure the adsorbent optimum. Then isotherm and synthetic were calculated ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015